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Abstract — We investigate the consensus problem in a net- 
work where nodes communicate via diffusion-based molecular 
communication (DbMC). In DbMC, messages are conveyed via 
the variation in the concentration of molecules in the medium. 
Every node acquires sensory information about the environment. 
Communication enables the nodes to reach the best estimate for 
that measurement, e.g., the average of the initial estimates by 
all nodes. We consider an iterative method for communication 
among nodes that enables information spreading and averaging 
in the network. We show that the consensus can be attained after 
a finite number of iterations and variance of estimates of nodes 
can be made arbitrarily small via communication. 

Index Terms — Molecular Communication, Diffusion, Consen- 
sus, Information networks, Distributed averaging 

I. Introduction 

There has been numerous evidence of the existence of a 
form of communication using molecules in nature. At the 
microorganism scale, molecular signals are used for communi- 
cation and control among cells in living tissues. Communica- 
tion enables single cells to process sensory information about 
their environment (in a way similar to neural networks) and 
evaluate and react to chemical stimuli. The use of chemical 
signaling by living cells has been under extensive study. For 
example, it is known that some bacteria use a process named 
"Quorum Sensing" to estimate the density of their kind in 
the environment [l]-[6]. Quorum Sensing is a decentralized 
coordination process which allows bacteria to estimate the 
density of their population and regulate their behavior accord- 
ing to the estimated density. To estimate the local population 
density, bacteria release specific signaling molecules. These 
molecules are subject to diffusion process that would make the 
molecules drift away instead of accumulating in the bacteria 
vicinity. Therefore, the concentration near the bacteria tends 
to the average concentration in the medium. As the local 
density of bacteria increases, so will the density of molecules 
in the medium. Bacteria have molecules receptors that can 
estimate the molecules density and thus the bacteria population 
density. Bacteria use quorum sensing to coordinate actions 
(mostly energy expensive) that cannot be carried out by a 
single bacterium. This phenomenon, captures most of the 
important components of a communication system in micro- 
scale. Arguably, the most dominating form of communication 
at the scale of microorganisms is Diffusion based Molecular 
Communication (DbMC), i.e., embedding the information in 
the alteration of the concentration of the molecules and rely 
on diffusion to transfer the information. 



Despite its importance, study of DbMC is still in infancy. 
Very recently, new theoretical frameworks are being developed 
for molecular communication [7]-[9]. Broadly speaking, the 
purpose of understanding DbMC is three-fold. First, in the 
field of system biology. For example, the new field of system 
biology enables us to engineer microorganisms for certain ob- 
jectives [10]-[12]. Our study can help to model and manipulate 
biological networks comprised of such engineered microor- 
ganisms. Secondly, recently, there is a new trend of designing 
micro- scale networks of nano- scale devices to perform tasks 
similar to their biological counterparts [13]. There is a large 
number of applications that such networks could apply to. One 
may envision molecular based networks built using these nano- 
scale devices that can be deployed over or inside the human 
body to monitor glucose, sodium, and cholesterol levels, to 
detect the presence of different infectious agents, or to identify 
specific types of cancer. Such networks will also enable new 
smart drug administrative systems to release specific drugs 
inside the body with great accuracy and in a timely manner. 
These networks are to operate in the environments similar 
to those of bacteria and other living organisms. Hence, the 
same principles hold for these networks and DbMC is the 
most promising form of communication in these bio-inspired 
networks. There is no need for complex computation in this 
type of communication which has been performed by bacteria 
for millions of years. Therefore, it can be done by primitive 
nano- scale devices that have limited computation capabilities. 
Third, understanding DbMC can help to find out whether there 
is any optimality in natural complex bio-systems. 

In this work, we study the consensus problem in a network 
governed by DbMC. In a general consensus problem, nodes 
in a network communicate with each other to obtain the best 
estimate given their initial estimates. Average value of these 
initial estimates is considered to be an important measure that 
can be considered as a goal in a network. In this work, we 
capture the situation where a network of nodes (agents) must 
achieve a consistent opinion through information exchanges 
via molecular signaling with their neighbors. 

The key element of this type of the consensus problem is 
diffusion. Diffusion describes the spread of particles through 
random motion from regions of higher concentration to regions 
of lower concentration. Every node has the capability of 
sensing the concentration of molecules in the environment and 
producing new molecules with desired rates. Molecules diffuse 
from the transmitter to the entire media. It is important to 



note that this type of molecular communication is different 
from other communication schemes where Brownian motion 
of single molecules is studied or molecules are directed toward 
specific directions. 

The rest of the paper is organized as follows: in Sec. II, 
we capture the effect of diffusion by studying the Fick's 
second law of diffusion and show as to how the production of 
molecules by each node affects the concentration level sensed 
by other nodes in the network. In Sec. Ill, the consensus 
problem is presented and formulated in a matrix form. Then, 
in Sec. IV, consensus problem is studied for compact and 
uniform networks and an iterative algorithm is proposed for 
the uniform type. In Sec. V, the convergence of the proposed 
algorithm is verified and the rate of the convergence is dis- 
cussed. Finally, in Sec. VI, the results in the previous sections 
is simulated and verified for a specific network. 

II. Background 

The scheme for a typical molecular communication network 
is depicted in Fig. 1. The communication between the shaded 
nodes in the network is modeled in Fig. 2. Each agent has 
the capability of sensing the concentration of molecules in 
the medium and release molecules at a specific rate back into 
the medium. In other words, each agent in such networks is a 
transceiver. Channel is the medium that molecules are injected 
into and carried depending on the diffusion coefficient of the 
medium. In molecular communication, information is encoded 
into the variations of molecule concentration, e.g., it can be 
in the form of different concentration levels corresponding to 
Amplitude Modulation. 

Two main features that distinguish the consensus problem 
under DbMC from that of the traditional electromagnetic 
communication are the broadcast nature of DbMC and lin- 
gering of the molecules in the shared medium. The molecules 
that are produced by nodes stay in the medium and change 
the concentration sensed by all the other nodes; there is no 
need for specialized routing which makes it fit for in-network 
processing. Hence, the first step is to characterize the temporal 
and spatial variations of molecules in the channel which 
follows the general diffusion equations. According to Fick's 
second law of diffusion, the concentration of molecules c(x, t) 
at position x at time t in an m-dimensional space (i.e. medium) 
is computed using the molecule production rate r(x,t) at the 
source, as follows: 

d -±A=DV 2 c{x,t)+r{x,t). (1) 

Here, x is the distance of any point in the environment from 
the source (assuming the source is at the zero position) and 
D is the diffusion coefficient of the medium. The impulse 
response of (1), corresponding to r(x,t) = S(x)S(t), is the 
Green's function gd(x,t) whose expression is as follows: 

5d(x '* )= (4^)¥ exp ("^)- 

This impulse response is given for an m-dimensional medium. 
Note that diffusion in an m-dimensional space is equivalent to 



m simultaneous 1-D diffusions. Since the diffusion equation 
is a linear equation, the solution to (1) for an arbitrary input 
r(x,t), denoted by c*(x,t), can be calculated using 

c*(x,t) =g d (x,t) ®r(x,t), 

where denotes multi-dimensional convolution operation on 
x and t. 

In our setup, the only molecule producers are nodes. There- 
fore, we have r(x,t) = F(t)5(x), where F(t) is the molecule 
production rate. Hence, we will have 

'™ - f ^ W«'-t))» exp (-15^) ) *• 

(2) 

This response is valid for open free medium where the only 
boundary conditions are at the transmitter. Note that in this 
model, we do not consider the delay due to the travel time of 
molecules between the nodes. That is we assume molecules 
reach the nodes instantly. This does not affect our analysis of 
consensus, because it only shifts the time that molecules are 
received at the nodes. 

III. Problem Statement 

We consider collaborative networks and refer to agents as 
network nodes. The goal is to spread the information about 
an event or any other variation through the network with 
minimum latency. The problem setup is such that each node in 
an Af-node network initially has a measurement value. These 
initial measurement values are assumed to be formed from 
estimating a specific parameter in the environment. These 
estimates are assumed to derive from a random variable. All 
the nodes try to obtain the best estimate for this random 
variable through the communication in the network. Node i 
maps its estimate of the environment parameter into the level 
of molecule concentration pi corresponding to this estimate. 
Network nodes exchange their estimates via producing proper 
molecular concentrations in the medium, to arrive at consen- 
sus. We assume these estimated concentrations are drawn from 
the same distribution iV(/x, 0q), i.e., a Gaussian distribution 
with expected value of p and variance of <Jq. 

The best unbiased estimate for p is the average of initial 
estimates of nodes, i.e. p = p av = jt J2iLi Pi- This estimate 
has variance of which can be arbitrarily small when N 
is large enough. Consensus is reached when estimate of each 
node approaches this value and the variance of their estimate 
approaches We propose an algorithm that ensures all 
nodes to arrive at an estimate for the average consensus in 
finite amount of time. After evaluating the average of these 
concentrations, nodes can map their molecule production rate 
back into the average of the parameter that they measured in 
the environment. 

In this network, nodes communicate with each other through 
diffusing molecules into the environment with different rates. 
These rates of molecule production change the concentration 
of molecules at vicinity of other nodes and convey the infor- 
mation from one node to another. Hence, each node needs 






Fig. 1: Node 1 Diffusing Molecules into a Network of 5 
Nodes. 
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Fig. 2: Model of Molecular Communication from node 1 to 
node 2. 



to choose the best rate based on its estimate and convey 
this estimate to other nodes. Based on received molecule 
concentration and their previous estimate, nodes improve their 
estimates iteratively until they reach a consensus. 

Assume node i has an initial estimate pi. The goal is to 
obtain the average p av = jfY^iLiPity every node. From (1), 
the concentration of molecules Ci(t) in the vicinity of node i 
is given by 

N 

i=i 

where c*j(xij,t) is the response at the position of node i 
due to production by node j. Furthermore, Xij denotes the 
distance between nodes i and j. In its classical communication 
analogue, we have a network of TV transceivers. Transmitter i 
sends the signal Fi (t) to TV receivers through TV different chan- 
nels with the impulse response gd(xij,t), j G {1,2...., TV}. At 
each receiver, the superposition of outputs from TV different 
channels is observed. 

The TV equations above should be optimized with respect 
to Fi(t) to produce the average of the initial estimates in the 
vicinity of each node at a specific time To. We assume node 
i transmits molecules with a constant rate Fi in the interval 
[0, To]. Therefore, at time To, the concentration of molecules 
at node i is given by 

N 

c i (t) = ^ t F j X(i,j), (3) 
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Note that X(i,j) 
network. 

Let F be a vector consisting of the production rate of all 
the nodes. Likewise, let c be the vector of concentration of 
molecules at vicinity of the network nodes. Then, we have 

c = XT. (5) 



In (5), X is a symmetric matrix whose entries X(i,j) are 
calculated according to (4). If we set every entry of c as p av , 
we will get a linear system of equations that can be solved to 
obtain Fi, the i th element of T. If X is invertible, there would 
be a unique solution for F. The important issue here is that 
each agent chooses Fi based solely on its estimate whereas the 
estimates of other nodes are unknown for the agent. Hence, 
Fi in (5) can depend only on pi but the general solution for 
F does not necessarily satisfy this condition. Therefore, we 
propose an iterative solution for reaching the consensus while 
imposing this dependency constraint. 

IV. Proposed Algorithm 

We consider a discrete model of time epochs of length To 
for analyzing the network. The length of an epoch depends 
on the topology of the network. In particular, it depends on 
the effective radius R of each node (which is explained later) 
and the diffusion coefficient of the medium D. We can assume 
T = k+jj where k is a constant. We arbitrarily choose k to be 
equal to one. Although our analysis is valid for other values of 
k. In each time epoch, node i emits molecules with a constant 
rate Fi. At the beginning of each time epoch, each node 
optimizes Fi based on its previous estimate and measurement 
of the molecular concentration in the environment, such that 
the estimate of all nodes becomes closer to p av . Hence, at the 
end of each interval (t = To), the concentration at node i can 
be obtained by (5). 

Note that after each interval, nodes need to wait for a 
specific amount of time, which will be again in the form 
of k^, before releasing the molecules for the next epoch. 
This waiting interval is needed to allow the molecules in the 
environment diffuse away or equivalently to reset the channel. 
For the rest of the paper, an iteration includes both the injection 
of molecules and the waiting intervals. 

Case I: First, we study the special case of compact net- 
works. In such networks, nodes are in vicinity of each other 
and approximately sense the same concentration of molecules 
within the radius of the network, i.e. = Cj, Vi, j. Molecules 
produced by each node would have the same share in making 



up the concentration in the network radius. Therefore, the 
matrix equation in (5) results in a single equation for the 
common concentration of nodes c(Tb) at the center of the 
network at time Tq: 
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c(T ) = J2 
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(6) 



Here, is the distance between the node j and the center of 
the network (note that we choose the center for the notation 
simplification). Furthermore, Xj is the result of integration 
in (6) which depends only on Xj and the network constants. 



Equation (6) implies that by setting Fj 
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equate the concentration c(To) to p av = f=i Pj- 
observe that the previously mentioned dependency constraint 
is taken into account. In this scenario, all the nodes are able 
to observe each other. Hence, by setting appropriate rates, 
they can reach the consensus in one step without any further 
iterations. In the following, we study the more challenging 
case that the network is extended and the concentration of 
molecules differs at the vicinity of each node. In addition, 
nodes may not be able to interact with all the other nodes. In 
this case, the information is conveyed from one side of the 
network to another by nodes acting as relays. 

Case II: We consider a network with nodes that are 
uniformly deployed in an m-dimensional medium. In a suf- 
ficiently large network, each node observes the same relative 
distances to the other nodes in the network (except for the 
nodes on the boundary of the network whose effects are neg- 
ligible when the number of nodes is large enough). Since the 
network is extended, each node has effective communications 
with only a specific number of nodes. To obtain this number, 
we consider a circle of radius r around each node which 
includes the nodes that it can have effective communications. 
The number of nodes at distance r from node i increases 
linearly with r. However, based on (2), the effect they have 

2 

on node i decreases as e~ r . Hence, the collective effect of 

2 

all the nodes at distance r varies as a factor of re~ r which 
approaches zero when r becomes large. By setting a threshold 
e for this factor, we can find R, the effective radius around 
each node. Therefore, the number of nodes TV' that each node 
is able to interact with, is equal to nR 2 d in a 2-D medium, 
where d is density of nodes in the medium. 

Based on the above discussion, the matrix X, will be 
an (TV x TV) symmetric matrix where the columns are the 
permutation of each other. In addition, at each column (or 
equivalently at each row), there are TV' nonzero elements 
whereas the rest of TV — TV' elements are zero. These zero 
elements correspond to the nodes that the node cannot interact 
directly. In this model, the effect of nodes on the boundary 



is disregarded. We note that in a general case, X would be 
a general symmetric matrix and the analysis will be more 
difficult. However, in the uniformly distributed case, we can 
propose an iterative method that ensures the reduction of 
variance in each iteration and convergence to the average 
value. 

Let p(n) be the estimate vector of nodes at epoch n and 
pi(n) be the i th element of the vector. In addition, let c(n) 
denotes the vector of molecule concentrations at the vicinity of 
nodes at epoch n. Assume each node has an initial estimate 
Pi(0). After each epoch, nodes update their estimate based 
on the molecules received from other nodes. Suppose Si to 
be the sum of entries in the i th column of X. Because of the 
isotropic setting, Si is the same for all columns, denoted by S. 
The nodes set their molecule production rate as F(n) = 
(we assume that each node is aware of the location of other 
nodes at its effective radius and hence can compute S). Hence, 
based on (5), after communication among nodes, we will have 



c(n) = X 



p(n) 



(J) 



S S l 

We observe that sum of each row or column in the matrix 
X = * is equal to one. Hence X is a doubly stochastic matrix. 
We assume that each node i updates its estimate by putting 
Pi(n + 1) = Ci(n). Thus, we have an iterative equation for 
estimate of nodes. 

p(n) = Xp(n-l). (8) 
In the following, we examine the convergence of (8). 

V. Convergence Analysis 

First, we verify that the iterative algorithm proposed in (8) 
results in an unbiased estimate for p = E[/^(0)] where p is 
the expected value of the Gaussian distribution that the initial 
estimate is from. We denote by E[p(n)] the vector of expected 
values of the estimates at iteration n. We know that E[p(0)] = 
/il, where 1 is the (TV x 1) vector whose elements are all 
one. Since the matrix X is assumed to be constant during the 
iterations, E[p(n)] is given by 



E[p(n)] = E[Xp(n-l)]=XE[p(n-l)]. 



(9) 



Since the sum of each row of X is equal to zero, XE[p(0)] = 
E[p(0)]. Hence, by continuing the chain in (9), we conclude 
that E[p(n)] = E[p(0)] = pi. This implies that the estimate 
of nodes at each step n is an unbiased estimate for the initial 
parameter. 

In order to study the variance of the estimates in each 
iteration, we need to elaborate on some characteristics of 
the matrix X. This doubly stochastic matrix resembles the 
transition matrix in a Markov chain. We observe that this 
transition matrix is aperiodic because the entries on the main 
diagonal, which shows the effect of each node on its own, are 
nonzero. It is also irreducible because the graph representation 
of this matrix is connected and we can reach other nodes from 
each node. Hence, we use the Perron-Frobenius Theorem [14], 
regarding the eigenvalues (A) of X: 



1) Ai = 1 and it is a singular root, 

2) \X N \ < |Aat-i| • • - < |A 2 | < 1. 

Since the sum of each row is one, the eigenvector corre- 
sponding to Ai is a uniform vector which is the consensus 
vector. The normalized form of this vector is v\ = ^pl- The 
eigenvalue decomposition of X is given by 

X = QAQ- 1 = QAQ T . (10) 

Here, Q is a matrix that whose columns are the eigenvectors of 
X and A is a diagonal matrix containing the eigenvalues of X. 
The result in (10) comes from the fact that X is a symmetric 
matrix. Hence, eigenvectors are orthogonal to each other and 
we have X" 1 = X T . Based on (10), we have X fe = QA k Q T . 
Therefore, the estimate of nodes at iteration n is given by 



p{n) = Xp(n-l) 
= X*p(0) 
= QA n Q T p(0). 



(11) 



Since all the eigenvalues except for Ai = 1 are smaller than 
one, all the entries in the main diagonal of matrix A approaches 
zero except for An = 1. Hence, we have 



lim p(n) 



^ivfp(O) 



(12) 



As we see in (12), after sufficient number of iterations, the 
estimate of nodes approaches to the average of initial beliefs. 
In order to quantify the rate of convergence, we look into 
the variance of estimates of nodes. As mentioned before, the 
smallest variance that can be achieved is which corresponds 
to the average value. The co variance matrix Cov(n) of the 
estimates at iteration n is given by 

Cov(n) = E[(p(n)-E[p(n)])(p(n)-E[p(n)]) T ] 
= X»E[p(0)-E[p(0)]) 

(p(0) - E[(p(0)]) r ] i± T r 

= X n Cov(0)X n . (13) 

where the fact that all powers of a symmetric matrix are also 
symmetric is used. Since the initial estimates are considered 
to be independent of each other, C 'ov (0) = (TqInxN where I 
is the identity matrix. Hence, from (13), Covin) = <JoX 2n . 
We denote the diagonal elements of the matrix Covin) by 
Covuin) which gives the variance of the estimate of each 
node at iteration n. For large n, we consider only the effect 
of A2, the second largest eigenvalue of X. Thus, by using the 
decomposition in (10), we have 



Cov^n) = o-H-+v 2 2i \f 



(14) 



where V2i denotes the i th element of eigenvector V2 corre- 
sponding to A2 . Since the norm of eigenvector V2 is one, then 
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Fig. 3: Convergence of the iterative algorithm versus the 
number of nodes in the network. 

V21 < 1, i £ 1, 2...., N. Hence, the variance of the estimate of 
node i at iteration n is obtained as 



i e 1,2...., A/". 



(15) 



By deploying more iterations, the upper-bound in (15) can 
become arbitrarily close to which approaches to zero in a 
network with large number of nodes. It is obvious that smaller 
A2 will result in a faster convergence. It can be proved that 
when matrix X becomes more sparse, i.e. the columns and the 
rows contain more zeros, A2 becomes closer to one. This can 
be explained by the fact that a more sparse X means a more 
extended network. Hence, more number of relays are needed 
to spread the information in the network and hence, reaching 
consensus will be more time consuming. In particular when 
X becomes a diagonal matrix, A2 is equal to one and of does 
not converge to ^cr 2 . This case is equivalent to the scenario 
in which none of the nodes are able to communicate with each 
other and their initial beliefs cannot be improved. 

VI. Case Study 

In this section, we present an example to verify our results 
in the previous section. We assume a network of N nodes 
that are placed on a line where the distance between two 
successive nodes is a constant a. This network would satisfy 
the uniform network condition in Sec. IV as long as this line 
extends to infinity in both ends. For a realistic situation, we 
assume N to be finite. We assume nodes communicate with 
each other through 2-D DbMC in the medium and the size of 
nodes is small such that it does not interfere with diffusion of 
molecules. 

We compute the elements of matrix X based on (4) and by 
normalizing the columns, we obtain X. Since, the boundary 
nodes are taken into account, the symmetric matrix of X would 
not be doubly stochastic. We compute the variance of columns 
of powers of X and verify the convergence of the matrix 
that governs the convergence of the estimates. The results for 



variant TV and fixed effective radius, i.e. fixed TV', is shown in 

2 

Fig. 3. We assumed To to be and N' equal to 5. In this plot, 
sum of the variance of columns is depicted versus the number 
of iterations. As we observe in the graph, as TV becomes larger, 
X becomes more sparse and the rate of convergence decreases 
since the second largest eigenvalue becomes larger. 

VII. Conclusion 

In this paper, we studied the consensus problem in a network 
that employs DbMC for communication between nodes. We 
first considered the simple case of compact networks and then 
extended the concept to more general distributed networks. In 
particular, we considered a network with uniformly distributed 
nodes in the medium. Then, we developed a protocol as to how 
each node should behave to arrive at an unbiased estimate. The 
resulting protocol is an iterative scheme which the variance of 
the estimates can be reduced arbitrarily. Future works include 
the general case of networks and the case that nodes do 
not have complete information about the other nodes in their 
vicinity. 
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